Overview

Dataset statistics

Number of variables35
Number of observations1460
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory399.3 KiB
Average record size in memory280.1 B

Variable types

NUM29
CAT6

Reproduction

Analysis started2020-04-23 16:52:00.247952
Analysis finished2020-04-23 16:54:59.097400
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
MiscVal is highly skewed (γ1 = 24.47679419) Skewed
BsmtFinSF1 has 467 (32.0%) zeros Zeros
BsmtFinSF2 has 1293 (88.6%) zeros Zeros
BsmtUnfSF has 118 (8.1%) zeros Zeros
TotalBsmtSF has 37 (2.5%) zeros Zeros
2ndFlrSF has 829 (56.8%) zeros Zeros
LowQualFinSF has 1434 (98.2%) zeros Zeros
GarageCars has 81 (5.5%) zeros Zeros
GarageArea has 81 (5.5%) zeros Zeros
WoodDeckSF has 761 (52.1%) zeros Zeros
OpenPorchSF has 656 (44.9%) zeros Zeros
EnclosedPorch has 1252 (85.8%) zeros Zeros
3SsnPorch has 1436 (98.4%) zeros Zeros
ScreenPorch has 1344 (92.1%) zeros Zeros
PoolArea has 1453 (99.5%) zeros Zeros
MiscVal has 1408 (96.4%) zeros Zeros

Variables

Id
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count1460
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean730.5
Minimum1
Maximum1460
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1
5-th percentile73.95
Q1365.75
median730.5
Q31095.25
95-th percentile1387.05
Maximum1460
Range1459
Interquartile range (IQR)729.5

Descriptive statistics

Standard deviation421.6100094
Coefficient of variation (CV)0.577152648
Kurtosis-1.2
Mean730.5
Median Absolute Deviation (MAD)365
Skewness0
Sum1066530
Variance177755
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00e+00 1.46e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1460 1 0.1%
 
479 1 0.1%
 
481 1 0.1%
 
482 1 0.1%
 
483 1 0.1%
 
484 1 0.1%
 
485 1 0.1%
 
486 1 0.1%
 
487 1 0.1%
 
488 1 0.1%
 
Other values (1450) 1450 99.3%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 1 0.1%
 
3 1 0.1%
 
4 1 0.1%
 
5 1 0.1%
 
ValueCountFrequency (%) 
1460 1 0.1%
 
1459 1 0.1%
 
1458 1 0.1%
 
1457 1 0.1%
 
1456 1 0.1%
 

MSSubClass
Real number (ℝ≥0)

Distinct count15
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.89726027
Minimum20
Maximum190
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum20
5-th percentile20
Q120
median50
Q370
95-th percentile160
Maximum190
Range170
Interquartile range (IQR)50

Descriptive statistics

Standard deviation42.30057099
Coefficient of variation (CV)0.7434553226
Kurtosis1.580187965
Mean56.89726027
Median Absolute Deviation (MAD)31.28274536
Skewness1.407656747
Sum83070
Variance1789.338306
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 20. 25. 35. 42.5 47.5 ... 77.5 82.5 170. 185. 190. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20 536 36.7%
 
60 299 20.5%
 
50 144 9.9%
 
120 87 6.0%
 
30 69 4.7%
 
160 63 4.3%
 
70 60 4.1%
 
80 58 4.0%
 
90 52 3.6%
 
190 30 2.1%
 
Other values (5) 62 4.2%
 
ValueCountFrequency (%) 
20 536 36.7%
 
30 69 4.7%
 
40 4 0.3%
 
45 12 0.8%
 
50 144 9.9%
 
ValueCountFrequency (%) 
190 30 2.1%
 
180 10 0.7%
 
160 63 4.3%
 
120 87 6.0%
 
90 52 3.6%
 

LotArea
Real number (ℝ≥0)

Distinct count1073
Unique (%)73.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10516.82808
Minimum1300
Maximum215245
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1300
5-th percentile3311.7
Q17553.5
median9478.5
Q311601.5
95-th percentile17401.15
Maximum215245
Range213945
Interquartile range (IQR)4048

Descriptive statistics

Standard deviation9981.264932
Coefficient of variation (CV)0.949075601
Kurtosis203.243271
Mean10516.82808
Median Absolute Deviation (MAD)3758.813815
Skewness12.20768785
Sum15354569
Variance99625649.65
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1300. 3181. 3189. 5962.5 6020. ... 14875.5 18015. 26160. 55352. 215245. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7200 25 1.7%
 
9600 24 1.6%
 
6000 17 1.2%
 
10800 14 1.0%
 
9000 14 1.0%
 
8400 14 1.0%
 
1680 10 0.7%
 
7500 9 0.6%
 
8125 8 0.5%
 
9100 8 0.5%
 
Other values (1063) 1317 90.2%
 
ValueCountFrequency (%) 
1300 1 0.1%
 
1477 1 0.1%
 
1491 1 0.1%
 
1526 1 0.1%
 
1533 2 0.1%
 
ValueCountFrequency (%) 
215245 1 0.1%
 
164660 1 0.1%
 
159000 1 0.1%
 
115149 1 0.1%
 
70761 1 0.1%
 

OverallQual
Real number (ℝ≥0)

Distinct count10
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.099315068
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1
5-th percentile4
Q15
median6
Q37
95-th percentile8
Maximum10
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.382996547
Coefficient of variation (CV)0.2267462053
Kurtosis0.09629277836
Mean6.099315068
Median Absolute Deviation (MAD)1.098048414
Skewness0.2169439278
Sum8905
Variance1.912679448
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 2.5 3.5 4.5 6.5 7.5 8.5 10. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 397 27.2%
 
6 374 25.6%
 
7 319 21.8%
 
8 168 11.5%
 
4 116 7.9%
 
9 43 2.9%
 
3 20 1.4%
 
10 18 1.2%
 
2 3 0.2%
 
1 2 0.1%
 
ValueCountFrequency (%) 
1 2 0.1%
 
2 3 0.2%
 
3 20 1.4%
 
4 116 7.9%
 
5 397 27.2%
 
ValueCountFrequency (%) 
10 18 1.2%
 
9 43 2.9%
 
8 168 11.5%
 
7 319 21.8%
 
6 374 25.6%
 

OverallCond
Real number (ℝ≥0)

Distinct count9
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.575342466
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1
5-th percentile4
Q15
median5
Q36
95-th percentile8
Maximum9
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.112799337
Coefficient of variation (CV)0.1995930014
Kurtosis1.106413461
Mean5.575342466
Median Absolute Deviation (MAD)0.8890223306
Skewness0.6930674725
Sum8140
Variance1.238322364
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 2.5 3.5 4.5 5.5 7.5 9. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 821 56.2%
 
6 252 17.3%
 
7 205 14.0%
 
8 72 4.9%
 
4 57 3.9%
 
3 25 1.7%
 
9 22 1.5%
 
2 5 0.3%
 
1 1 0.1%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 5 0.3%
 
3 25 1.7%
 
4 57 3.9%
 
5 821 56.2%
 
ValueCountFrequency (%) 
9 22 1.5%
 
8 72 4.9%
 
7 205 14.0%
 
6 252 17.3%
 
5 821 56.2%
 

YearBuilt
Real number (ℝ≥0)

Distinct count112
Unique (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1971.267808
Minimum1872
Maximum2010
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1872
5-th percentile1916
Q11954
median1973
Q32000
95-th percentile2007
Maximum2010
Range138
Interquartile range (IQR)46

Descriptive statistics

Standard deviation30.20290404
Coefficient of variation (CV)0.01532156307
Kurtosis-0.4395519416
Mean1971.267808
Median Absolute Deviation (MAD)25.06722274
Skewness-0.6134611725
Sum2878051
Variance912.2154126
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1872. 1909. 1919.5 1920.5 1939.5 ... 1991.5 2002.5 2007.5 2009.5 2010. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2006 67 4.6%
 
2005 64 4.4%
 
2004 54 3.7%
 
2007 49 3.4%
 
2003 45 3.1%
 
1976 33 2.3%
 
1977 32 2.2%
 
1920 30 2.1%
 
1959 26 1.8%
 
1999 25 1.7%
 
Other values (102) 1035 70.9%
 
ValueCountFrequency (%) 
1872 1 0.1%
 
1875 1 0.1%
 
1880 4 0.3%
 
1882 1 0.1%
 
1885 2 0.1%
 
ValueCountFrequency (%) 
2010 1 0.1%
 
2009 18 1.2%
 
2008 23 1.6%
 
2007 49 3.4%
 
2006 67 4.6%
 

YearRemodAdd
Real number (ℝ≥0)

Distinct count61
Unique (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1984.865753
Minimum1950
Maximum2010
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1950
5-th percentile1950
Q11967
median1994
Q32004
95-th percentile2007
Maximum2010
Range60
Interquartile range (IQR)37

Descriptive statistics

Standard deviation20.64540681
Coefficient of variation (CV)0.01040141217
Kurtosis-1.272245192
Mean1984.865753
Median Absolute Deviation (MAD)18.62315256
Skewness-0.5035620027
Sum2897904
Variance426.2328223
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1950. 1950.5 1952.5 1975.5 1978.5 ... 1994.5 2001.5 2004.5 2007.5 2010. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1950 178 12.2%
 
2006 97 6.6%
 
2007 76 5.2%
 
2005 73 5.0%
 
2004 62 4.2%
 
2000 55 3.8%
 
2003 51 3.5%
 
2002 48 3.3%
 
2008 40 2.7%
 
1996 36 2.5%
 
Other values (51) 744 51.0%
 
ValueCountFrequency (%) 
1950 178 12.2%
 
1951 4 0.3%
 
1952 5 0.3%
 
1953 10 0.7%
 
1954 14 1.0%
 
ValueCountFrequency (%) 
2010 6 0.4%
 
2009 23 1.6%
 
2008 40 2.7%
 
2007 76 5.2%
 
2006 97 6.6%
 

BsmtFinSF1
Real number (ℝ≥0)

ZEROS
Distinct count637
Unique (%)43.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean443.639726
Minimum0
Maximum5644
Zeros467
Zeros (%)32.0%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median383.5
Q3712.25
95-th percentile1274
Maximum5644
Range5644
Interquartile range (IQR)712.25

Descriptive statistics

Standard deviation456.0980908
Coefficient of variation (CV)1.028082167
Kurtosis11.11823629
Mean443.639726
Median Absolute Deviation (MAD)367.3696735
Skewness1.685503072
Sum647714
Variance208025.4685
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 2.200e+01 2.450e+01 1.795e+02 7.895e+02 1.087e+03 1.475e+03 2.224e+03 5.644e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 467 32.0%
 
24 12 0.8%
 
16 9 0.6%
 
20 5 0.3%
 
686 5 0.3%
 
616 5 0.3%
 
936 5 0.3%
 
662 5 0.3%
 
428 4 0.3%
 
655 4 0.3%
 
Other values (627) 939 64.3%
 
ValueCountFrequency (%) 
0 467 32.0%
 
2 1 0.1%
 
16 9 0.6%
 
20 5 0.3%
 
24 12 0.8%
 
ValueCountFrequency (%) 
5644 1 0.1%
 
2260 1 0.1%
 
2188 1 0.1%
 
2096 1 0.1%
 
1904 1 0.1%
 

BsmtFinSF2
Real number (ℝ≥0)

ZEROS
Distinct count144
Unique (%)9.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.54931507
Minimum0
Maximum1474
Zeros1293
Zeros (%)88.6%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile396.2
Maximum1474
Range1474
Interquartile range (IQR)0

Descriptive statistics

Standard deviation161.3192728
Coefficient of variation (CV)3.465556315
Kurtosis20.11333755
Mean46.54931507
Median Absolute Deviation (MAD)82.53501407
Skewness4.255261109
Sum67962
Variance26023.90778
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 14. 178.5 183. 718. 1123.5 1474. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1293 88.6%
 
180 5 0.3%
 
374 3 0.2%
 
551 2 0.1%
 
93 2 0.1%
 
468 2 0.1%
 
147 2 0.1%
 
480 2 0.1%
 
539 2 0.1%
 
712 2 0.1%
 
Other values (134) 145 9.9%
 
ValueCountFrequency (%) 
0 1293 88.6%
 
28 1 0.1%
 
32 1 0.1%
 
35 1 0.1%
 
40 1 0.1%
 
ValueCountFrequency (%) 
1474 1 0.1%
 
1127 1 0.1%
 
1120 1 0.1%
 
1085 1 0.1%
 
1080 1 0.1%
 

BsmtUnfSF
Real number (ℝ≥0)

ZEROS
Distinct count780
Unique (%)53.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean567.240411
Minimum0
Maximum2336
Zeros118
Zeros (%)8.1%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1223
median477.5
Q3808
95-th percentile1468
Maximum2336
Range2336
Interquartile range (IQR)585

Descriptive statistics

Standard deviation441.8669553
Coefficient of variation (CV)0.7789765094
Kurtosis0.4749939878
Mean567.240411
Median Absolute Deviation (MAD)353.2816157
Skewness0.9202684528
Sum828171
Variance195246.4062
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 7. 74.5 441.5 817. 977.5 1500. 1818. 2336. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 118 8.1%
 
728 9 0.6%
 
384 8 0.5%
 
572 7 0.5%
 
600 7 0.5%
 
300 7 0.5%
 
440 6 0.4%
 
625 6 0.4%
 
280 6 0.4%
 
672 6 0.4%
 
Other values (770) 1280 87.7%
 
ValueCountFrequency (%) 
0 118 8.1%
 
14 1 0.1%
 
15 1 0.1%
 
23 2 0.1%
 
26 1 0.1%
 
ValueCountFrequency (%) 
2336 1 0.1%
 
2153 1 0.1%
 
2121 1 0.1%
 
2046 1 0.1%
 
2042 1 0.1%
 

TotalBsmtSF
Real number (ℝ≥0)

ZEROS
Distinct count721
Unique (%)49.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1057.429452
Minimum0
Maximum6110
Zeros37
Zeros (%)2.5%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile519.3
Q1795.75
median991.5
Q31298.25
95-th percentile1753
Maximum6110
Range6110
Interquartile range (IQR)502.5

Descriptive statistics

Standard deviation438.7053245
Coefficient of variation (CV)0.4148790481
Kurtosis13.25048328
Mean1057.429452
Median Absolute Deviation (MAD)321.2843732
Skewness1.524254549
Sum1543847
Variance192462.3617
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 52.5 366. 482.5 484. ... 1503. 1738. 2155.5 3203. 6110. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 37 2.5%
 
864 35 2.4%
 
672 17 1.2%
 
912 15 1.0%
 
1040 14 1.0%
 
816 13 0.9%
 
728 12 0.8%
 
768 12 0.8%
 
848 11 0.8%
 
780 11 0.8%
 
Other values (711) 1283 87.9%
 
ValueCountFrequency (%) 
0 37 2.5%
 
105 1 0.1%
 
190 1 0.1%
 
264 3 0.2%
 
270 1 0.1%
 
ValueCountFrequency (%) 
6110 1 0.1%
 
3206 1 0.1%
 
3200 1 0.1%
 
3138 1 0.1%
 
3094 1 0.1%
 

1stFlrSF
Real number (ℝ≥0)

Distinct count753
Unique (%)51.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1162.626712
Minimum334
Maximum4692
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum334
5-th percentile672.95
Q1882
median1087
Q31391.25
95-th percentile1831.25
Maximum4692
Range4358
Interquartile range (IQR)509.25

Descriptive statistics

Standard deviation386.587738
Coefficient of variation (CV)0.3325123481
Kurtosis5.745841482
Mean1162.626712
Median Absolute Deviation (MAD)300.5763089
Skewness1.376756622
Sum1697435
Variance149450.0792
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 334. 481.5 659. 762. 847.5 ... 1393. 1738. 2132.5 2578.5 4692. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
864 25 1.7%
 
1040 16 1.1%
 
912 14 1.0%
 
848 12 0.8%
 
894 12 0.8%
 
672 11 0.8%
 
816 9 0.6%
 
630 9 0.6%
 
936 7 0.5%
 
960 7 0.5%
 
Other values (743) 1338 91.6%
 
ValueCountFrequency (%) 
334 1 0.1%
 
372 1 0.1%
 
438 1 0.1%
 
480 1 0.1%
 
483 7 0.5%
 
ValueCountFrequency (%) 
4692 1 0.1%
 
3228 1 0.1%
 
3138 1 0.1%
 
2898 1 0.1%
 
2633 1 0.1%
 

2ndFlrSF
Real number (ℝ≥0)

ZEROS
Distinct count417
Unique (%)28.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean346.9924658
Minimum0
Maximum2065
Zeros829
Zeros (%)56.8%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3728
95-th percentile1141.05
Maximum2065
Range2065
Interquartile range (IQR)728

Descriptive statistics

Standard deviation436.5284359
Coefficient of variation (CV)1.258034335
Kurtosis-0.5534635576
Mean346.9924658
Median Absolute Deviation (MAD)396.4775493
Skewness0.8130298163
Sum506609
Variance190557.0753
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 55. 200. 431. 545.5 ... 728.5 916.5 1366. 1564.5 2065. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 829 56.8%
 
728 10 0.7%
 
504 9 0.6%
 
672 8 0.5%
 
546 8 0.5%
 
720 7 0.5%
 
600 7 0.5%
 
896 6 0.4%
 
780 5 0.3%
 
862 5 0.3%
 
Other values (407) 566 38.8%
 
ValueCountFrequency (%) 
0 829 56.8%
 
110 1 0.1%
 
167 1 0.1%
 
192 1 0.1%
 
208 1 0.1%
 
ValueCountFrequency (%) 
2065 1 0.1%
 
1872 1 0.1%
 
1818 1 0.1%
 
1796 1 0.1%
 
1611 1 0.1%
 

LowQualFinSF
Real number (ℝ≥0)

ZEROS
Distinct count24
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.844520548
Minimum0
Maximum572
Zeros1434
Zeros (%)98.2%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum572
Range572
Interquartile range (IQR)0

Descriptive statistics

Standard deviation48.62308143
Coefficient of variation (CV)8.319430317
Kurtosis83.23481667
Mean5.844520548
Median Absolute Deviation (MAD)11.48088009
Skewness9.011341288
Sum8533
Variance2364.204048
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 26.5 572. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1434 98.2%
 
80 3 0.2%
 
360 2 0.1%
 
528 1 0.1%
 
53 1 0.1%
 
120 1 0.1%
 
144 1 0.1%
 
156 1 0.1%
 
205 1 0.1%
 
232 1 0.1%
 
Other values (14) 14 1.0%
 
ValueCountFrequency (%) 
0 1434 98.2%
 
53 1 0.1%
 
80 3 0.2%
 
120 1 0.1%
 
144 1 0.1%
 
ValueCountFrequency (%) 
572 1 0.1%
 
528 1 0.1%
 
515 1 0.1%
 
514 1 0.1%
 
513 1 0.1%
 

GrLivArea
Real number (ℝ≥0)

Distinct count861
Unique (%)59.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1515.463699
Minimum334
Maximum5642
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum334
5-th percentile848
Q11129.5
median1464
Q31776.75
95-th percentile2466.1
Maximum5642
Range5308
Interquartile range (IQR)647.25

Descriptive statistics

Standard deviation525.4803834
Coefficient of variation (CV)0.3467456092
Kurtosis4.895120581
Mean1515.463699
Median Absolute Deviation (MAD)397.3249381
Skewness1.366560356
Sum2212577
Variance276129.6334
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 334. 610.5 765.5 862.5 865. ... 2137. 2650. 2885. 3617.5 5642. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
864 22 1.5%
 
1040 14 1.0%
 
894 11 0.8%
 
848 10 0.7%
 
1456 10 0.7%
 
912 9 0.6%
 
1200 9 0.6%
 
816 8 0.5%
 
1092 8 0.5%
 
1344 7 0.5%
 
Other values (851) 1352 92.6%
 
ValueCountFrequency (%) 
334 1 0.1%
 
438 1 0.1%
 
480 1 0.1%
 
520 1 0.1%
 
605 1 0.1%
 
ValueCountFrequency (%) 
5642 1 0.1%
 
4676 1 0.1%
 
4476 1 0.1%
 
4316 1 0.1%
 
3627 1 0.1%
 

BsmtFullBath
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
0
856
1
588
2
 
15
3
 
1
ValueCountFrequency (%) 
0 856 58.6%
 
1 588 40.3%
 
2 15 1.0%
 
3 1 0.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

BsmtHalfBath
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
0
1378
1
 
80
2
 
2
ValueCountFrequency (%) 
0 1378 94.4%
 
1 80 5.5%
 
2 2 0.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

FullBath
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
2
768
1
650
3
 
33
0
 
9
ValueCountFrequency (%) 
2 768 52.6%
 
1 650 44.5%
 
3 33 2.3%
 
0 9 0.6%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

HalfBath
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
0
913
1
535
2
 
12
ValueCountFrequency (%) 
0 913 62.5%
 
1 535 36.6%
 
2 12 0.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

BedroomAbvGr
Real number (ℝ≥0)

Distinct count8
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.866438356
Minimum0
Maximum8
Zeros6
Zeros (%)0.4%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median3
Q33
95-th percentile4
Maximum8
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.8157780441
Coefficient of variation (CV)0.2845964025
Kurtosis2.230874582
Mean2.866438356
Median Absolute Deviation (MAD)0.576308876
Skewness0.2117900963
Sum4185
Variance0.6654938173
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 4.5 5.5 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 804 55.1%
 
2 358 24.5%
 
4 213 14.6%
 
1 50 3.4%
 
5 21 1.4%
 
6 7 0.5%
 
0 6 0.4%
 
8 1 0.1%
 
ValueCountFrequency (%) 
0 6 0.4%
 
1 50 3.4%
 
2 358 24.5%
 
3 804 55.1%
 
4 213 14.6%
 
ValueCountFrequency (%) 
8 1 0.1%
 
6 7 0.5%
 
5 21 1.4%
 
4 213 14.6%
 
3 804 55.1%
 

KitchenAbvGr
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
1
1392
2
 
65
3
 
2
0
 
1
ValueCountFrequency (%) 
1 1392 95.3%
 
2 65 4.5%
 
3 2 0.1%
 
0 1 0.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

TotRmsAbvGrd
Real number (ℝ≥0)

Distinct count12
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.517808219
Minimum2
Maximum14
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum2
5-th percentile4
Q15
median6
Q37
95-th percentile10
Maximum14
Range12
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.625393291
Coefficient of variation (CV)0.2493772808
Kurtosis0.8807615657
Mean6.517808219
Median Absolute Deviation (MAD)1.279594671
Skewness0.6763408364
Sum9516
Variance2.641903349
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 2.5 3.5 4.5 5.5 7.5 8.5 10.5 13. 14. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6 402 27.5%
 
7 329 22.5%
 
5 275 18.8%
 
8 187 12.8%
 
4 97 6.6%
 
9 75 5.1%
 
10 47 3.2%
 
11 18 1.2%
 
3 17 1.2%
 
12 11 0.8%
 
Other values (2) 2 0.1%
 
ValueCountFrequency (%) 
2 1 0.1%
 
3 17 1.2%
 
4 97 6.6%
 
5 275 18.8%
 
6 402 27.5%
 
ValueCountFrequency (%) 
14 1 0.1%
 
12 11 0.8%
 
11 18 1.2%
 
10 47 3.2%
 
9 75 5.1%
 

Fireplaces
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.5 KiB
0
690
1
650
2
 
115
3
 
5
ValueCountFrequency (%) 
0 690 47.3%
 
1 650 44.5%
 
2 115 7.9%
 
3 5 0.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

GarageCars
Real number (ℝ≥0)

ZEROS
Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.767123288
Minimum0
Maximum4
Zeros81
Zeros (%)5.5%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q32
95-th percentile3
Maximum4
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7473150101
Coefficient of variation (CV)0.4228991918
Kurtosis0.220997764
Mean1.767123288
Median Absolute Deviation (MAD)0.5838431225
Skewness-0.3425489297
Sum2580
Variance0.5584797243
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 824 56.4%
 
1 369 25.3%
 
3 181 12.4%
 
0 81 5.5%
 
4 5 0.3%
 
ValueCountFrequency (%) 
0 81 5.5%
 
1 369 25.3%
 
2 824 56.4%
 
3 181 12.4%
 
4 5 0.3%
 
ValueCountFrequency (%) 
4 5 0.3%
 
3 181 12.4%
 
2 824 56.4%
 
1 369 25.3%
 
0 81 5.5%
 

GarageArea
Real number (ℝ≥0)

ZEROS
Distinct count441
Unique (%)30.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean472.980137
Minimum0
Maximum1418
Zeros81
Zeros (%)5.5%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1334.5
median480
Q3576
95-th percentile850.1
Maximum1418
Range1418
Interquartile range (IQR)241.5

Descriptive statistics

Standard deviation213.8048415
Coefficient of variation (CV)0.452037675
Kurtosis0.9170672023
Mean472.980137
Median Absolute Deviation (MAD)160.0190646
Skewness0.1799809067
Sum690551
Variance45712.51023
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 80. 172. 237. 242. ... 671.5 672.5 910. 1061. 1418. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 81 5.5%
 
440 49 3.4%
 
576 47 3.2%
 
240 38 2.6%
 
484 34 2.3%
 
528 33 2.3%
 
288 27 1.8%
 
400 25 1.7%
 
480 24 1.6%
 
264 24 1.6%
 
Other values (431) 1078 73.8%
 
ValueCountFrequency (%) 
0 81 5.5%
 
160 2 0.1%
 
164 1 0.1%
 
180 9 0.6%
 
186 1 0.1%
 
ValueCountFrequency (%) 
1418 1 0.1%
 
1390 1 0.1%
 
1356 1 0.1%
 
1248 1 0.1%
 
1220 1 0.1%
 

WoodDeckSF
Real number (ℝ≥0)

ZEROS
Distinct count274
Unique (%)18.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94.24452055
Minimum0
Maximum857
Zeros761
Zeros (%)52.1%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3168
95-th percentile335
Maximum857
Range857
Interquartile range (IQR)168

Descriptive statistics

Standard deviation125.3387944
Coefficient of variation (CV)1.329931901
Kurtosis2.992950925
Mean94.24452055
Median Absolute Deviation (MAD)101.9957947
Skewness1.541375757
Sum137597
Variance15709.81337
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 6. 25. 99. 101.5 ... 239.5 240.5 370.5 518. 857. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 761 52.1%
 
192 38 2.6%
 
100 36 2.5%
 
144 33 2.3%
 
120 31 2.1%
 
168 28 1.9%
 
140 15 1.0%
 
224 14 1.0%
 
240 10 0.7%
 
208 10 0.7%
 
Other values (264) 484 33.2%
 
ValueCountFrequency (%) 
0 761 52.1%
 
12 2 0.1%
 
24 2 0.1%
 
26 2 0.1%
 
28 2 0.1%
 
ValueCountFrequency (%) 
857 1 0.1%
 
736 1 0.1%
 
728 1 0.1%
 
670 1 0.1%
 
668 1 0.1%
 

OpenPorchSF
Real number (ℝ≥0)

ZEROS
Distinct count202
Unique (%)13.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.66027397
Minimum0
Maximum547
Zeros656
Zeros (%)44.9%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median25
Q368
95-th percentile175.05
Maximum547
Range547
Interquartile range (IQR)68

Descriptive statistics

Standard deviation66.25602768
Coefficient of variation (CV)1.419966538
Kurtosis8.490335806
Mean46.66027397
Median Absolute Deviation (MAD)47.67807844
Skewness2.36434174
Sum68124
Variance4389.861203
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 2. 15.5 19. 47.5 ... 76.5 122.5 171. 291.5 547. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 656 44.9%
 
36 29 2.0%
 
48 22 1.5%
 
20 21 1.4%
 
40 19 1.3%
 
45 19 1.3%
 
30 16 1.1%
 
24 16 1.1%
 
60 15 1.0%
 
39 14 1.0%
 
Other values (192) 633 43.4%
 
ValueCountFrequency (%) 
0 656 44.9%
 
4 1 0.1%
 
8 1 0.1%
 
10 1 0.1%
 
11 1 0.1%
 
ValueCountFrequency (%) 
547 1 0.1%
 
523 1 0.1%
 
502 1 0.1%
 
418 1 0.1%
 
406 1 0.1%
 

EnclosedPorch
Real number (ℝ≥0)

ZEROS
Distinct count120
Unique (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.95410959
Minimum0
Maximum552
Zeros1252
Zeros (%)85.8%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile180.15
Maximum552
Range552
Interquartile range (IQR)0

Descriptive statistics

Standard deviation61.1191486
Coefficient of variation (CV)2.783950237
Kurtosis10.43076594
Mean21.95410959
Median Absolute Deviation (MAD)37.65952524
Skewness3.089871904
Sum32053
Variance3735.550326
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 9.5 110. 113. 253. 297.5 552. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1252 85.8%
 
112 15 1.0%
 
96 6 0.4%
 
120 5 0.3%
 
144 5 0.3%
 
192 5 0.3%
 
216 5 0.3%
 
252 4 0.3%
 
116 4 0.3%
 
156 4 0.3%
 
Other values (110) 155 10.6%
 
ValueCountFrequency (%) 
0 1252 85.8%
 
19 1 0.1%
 
20 1 0.1%
 
24 1 0.1%
 
30 1 0.1%
 
ValueCountFrequency (%) 
552 1 0.1%
 
386 1 0.1%
 
330 1 0.1%
 
318 1 0.1%
 
301 1 0.1%
 

3SsnPorch
Real number (ℝ≥0)

ZEROS
Distinct count20
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.409589041
Minimum0
Maximum508
Zeros1436
Zeros (%)98.4%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum508
Range508
Interquartile range (IQR)0

Descriptive statistics

Standard deviation29.31733056
Coefficient of variation (CV)8.598493896
Kurtosis123.6623794
Mean3.409589041
Median Absolute Deviation (MAD)6.707082004
Skewness10.30434203
Sum4978
Variance859.505871
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 11.5 135. 189. 508. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1436 98.4%
 
168 3 0.2%
 
216 2 0.1%
 
144 2 0.1%
 
180 2 0.1%
 
245 1 0.1%
 
238 1 0.1%
 
290 1 0.1%
 
196 1 0.1%
 
182 1 0.1%
 
Other values (10) 10 0.7%
 
ValueCountFrequency (%) 
0 1436 98.4%
 
23 1 0.1%
 
96 1 0.1%
 
130 1 0.1%
 
140 1 0.1%
 
ValueCountFrequency (%) 
508 1 0.1%
 
407 1 0.1%
 
320 1 0.1%
 
304 1 0.1%
 
290 1 0.1%
 

ScreenPorch
Real number (ℝ≥0)

ZEROS
Distinct count76
Unique (%)5.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.0609589
Minimum0
Maximum480
Zeros1344
Zeros (%)92.1%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile160
Maximum480
Range480
Interquartile range (IQR)0

Descriptive statistics

Standard deviation55.75741528
Coefficient of variation (CV)3.70211589
Kurtosis18.43906784
Mean15.0609589
Median Absolute Deviation (MAD)27.72866954
Skewness4.122213743
Sum21989
Variance3108.889359
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 20. 117.5 224.5 289.5 480. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1344 92.1%
 
192 6 0.4%
 
224 5 0.3%
 
120 5 0.3%
 
189 4 0.3%
 
180 4 0.3%
 
160 3 0.2%
 
168 3 0.2%
 
144 3 0.2%
 
126 3 0.2%
 
Other values (66) 80 5.5%
 
ValueCountFrequency (%) 
0 1344 92.1%
 
40 1 0.1%
 
53 1 0.1%
 
60 1 0.1%
 
63 1 0.1%
 
ValueCountFrequency (%) 
480 1 0.1%
 
440 1 0.1%
 
410 1 0.1%
 
396 1 0.1%
 
385 1 0.1%
 

PoolArea
Real number (ℝ≥0)

ZEROS
Distinct count8
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.75890411
Minimum0
Maximum738
Zeros1453
Zeros (%)99.5%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum738
Range738
Interquartile range (IQR)0

Descriptive statistics

Standard deviation40.17730694
Coefficient of variation (CV)14.56277759
Kurtosis223.2684989
Mean2.75890411
Median Absolute Deviation (MAD)5.491352974
Skewness14.82837364
Sum4028
Variance1614.215993
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 240. 738.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1453 99.5%
 
738 1 0.1%
 
648 1 0.1%
 
576 1 0.1%
 
555 1 0.1%
 
519 1 0.1%
 
512 1 0.1%
 
480 1 0.1%
 
ValueCountFrequency (%) 
0 1453 99.5%
 
480 1 0.1%
 
512 1 0.1%
 
519 1 0.1%
 
555 1 0.1%
 
ValueCountFrequency (%) 
738 1 0.1%
 
648 1 0.1%
 
576 1 0.1%
 
555 1 0.1%
 
519 1 0.1%
 

MiscVal
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count21
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.4890411
Minimum0
Maximum15500
Zeros1408
Zeros (%)96.4%
Memory size11.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum15500
Range15500
Interquartile range (IQR)0

Descriptive statistics

Standard deviation496.1230245
Coefficient of variation (CV)11.408001
Kurtosis701.0033423
Mean43.4890411
Median Absolute Deviation (MAD)83.88023269
Skewness24.47679419
Sum63494
Variance246138.0554
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 27. 375. 530. 750. 2250. 15500.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1408 96.4%
 
400 11 0.8%
 
500 8 0.5%
 
700 5 0.3%
 
450 4 0.3%
 
2000 4 0.3%
 
600 4 0.3%
 
1200 2 0.1%
 
480 2 0.1%
 
1150 1 0.1%
 
Other values (11) 11 0.8%
 
ValueCountFrequency (%) 
0 1408 96.4%
 
54 1 0.1%
 
350 1 0.1%
 
400 11 0.8%
 
450 4 0.3%
 
ValueCountFrequency (%) 
15500 1 0.1%
 
8300 1 0.1%
 
3500 1 0.1%
 
2500 1 0.1%
 
2000 4 0.3%
 

MoSold
Real number (ℝ≥0)

Distinct count12
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.321917808
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum1
5-th percentile2
Q15
median6
Q38
95-th percentile11
Maximum12
Range11
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.703626208
Coefficient of variation (CV)0.4276591836
Kurtosis-0.4041093415
Mean6.321917808
Median Absolute Deviation (MAD)2.142522049
Skewness0.2120529851
Sum9230
Variance7.309594675
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 4.5 7.5 8.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6 253 17.3%
 
7 234 16.0%
 
5 204 14.0%
 
4 141 9.7%
 
8 122 8.4%
 
3 106 7.3%
 
10 89 6.1%
 
11 79 5.4%
 
9 63 4.3%
 
12 59 4.0%
 
Other values (2) 110 7.5%
 
ValueCountFrequency (%) 
1 58 4.0%
 
2 52 3.6%
 
3 106 7.3%
 
4 141 9.7%
 
5 204 14.0%
 
ValueCountFrequency (%) 
12 59 4.0%
 
11 79 5.4%
 
10 89 6.1%
 
9 63 4.3%
 
8 122 8.4%
 

YrSold
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.815753
Minimum2006
Maximum2010
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum2006
5-th percentile2006
Q12007
median2008
Q32009
95-th percentile2010
Maximum2010
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.328095121
Coefficient of variation (CV)0.0006614626458
Kurtosis-1.190600571
Mean2007.815753
Median Absolute Deviation (MAD)1.148670482
Skewness0.09626851387
Sum2931411
Variance1.763836649
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2006. 2006.5 2010. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2009 338 23.2%
 
2007 329 22.5%
 
2006 314 21.5%
 
2008 304 20.8%
 
2010 175 12.0%
 
ValueCountFrequency (%) 
2006 314 21.5%
 
2007 329 22.5%
 
2008 304 20.8%
 
2009 338 23.2%
 
2010 175 12.0%
 
ValueCountFrequency (%) 
2010 175 12.0%
 
2009 338 23.2%
 
2008 304 20.8%
 
2007 329 22.5%
 
2006 314 21.5%
 

SalePrice
Real number (ℝ≥0)

Distinct count663
Unique (%)45.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean180921.1959
Minimum34900
Maximum755000
Zeros0
Zeros (%)0.0%
Memory size11.5 KiB

Quantile statistics

Minimum34900
5-th percentile88000
Q1129975
median163000
Q3214000
95-th percentile326100
Maximum755000
Range720100
Interquartile range (IQR)84025

Descriptive statistics

Standard deviation79442.50288
Coefficient of variation (CV)0.4391000319
Kurtosis6.53628186
Mean180921.1959
Median Absolute Deviation (MAD)57434.77028
Skewness1.88287576
Sum264144946
Variance6311111264
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 34900. 78500. 104950. 134950. 135250. ... 240500. 280500. 340500. 443130.5 755000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
140000 20 1.4%
 
135000 17 1.2%
 
145000 14 1.0%
 
155000 14 1.0%
 
190000 13 0.9%
 
110000 13 0.9%
 
160000 12 0.8%
 
115000 12 0.8%
 
139000 11 0.8%
 
130000 11 0.8%
 
Other values (653) 1323 90.6%
 
ValueCountFrequency (%) 
34900 1 0.1%
 
35311 1 0.1%
 
37900 1 0.1%
 
39300 1 0.1%
 
40000 1 0.1%
 
ValueCountFrequency (%) 
755000 1 0.1%
 
745000 1 0.1%
 
625000 1 0.1%
 
611657 1 0.1%
 
582933 1 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

IdMSSubClassLotAreaOverallQualOverallCondYearBuiltYearRemodAddBsmtFinSF1BsmtFinSF2BsmtUnfSFTotalBsmtSF1stFlrSF2ndFlrSFLowQualFinSFGrLivAreaBsmtFullBathBsmtHalfBathFullBathHalfBathBedroomAbvGrKitchenAbvGrTotRmsAbvGrdFireplacesGarageCarsGarageAreaWoodDeckSFOpenPorchSFEnclosedPorch3SsnPorchScreenPorchPoolAreaMiscValMoSoldYrSoldSalePrice
0160845075200320037060150856856854017101021318025480610000022008208500
12209600681976197697802841262126200126201203161246029800000052007181500
23601125075200120024860434920920866017861021316126080420000092008223500
347095507519151970216054075696175601717101031713642035272000022006140000
45601426085200020006550490114511451053021981021419138361928400000122008250000
5650141155519931995732064796796566013621011115024804030032000700102009143000
67201008485200420051369031716861694001694102031712636255570000082007307000
7860103827619731973859322161107110798302090102131722484235204228000350112009200000
89506120751931195000952952102275201774002022822468900205000042008129900
9101907420561939195085101409911077001077101022521205040000012008118000

Last rows

IdMSSubClassLotAreaOverallQualOverallCondYearBuiltYearRemodAddBsmtFinSF1BsmtFinSF2BsmtUnfSFTotalBsmtSF1stFlrSF2ndFlrSFLowQualFinSFGrLivAreaBsmtFullBathBsmtHalfBathFullBathHalfBathBedroomAbvGrKitchenAbvGrTotRmsAbvGrdFireplacesGarageCarsGarageAreaWoodDeckSFOpenPorchSFEnclosedPorch3SsnPorchScreenPorchPoolAreaMiscValMoSoldYrSoldSalePrice
1450145190900055197419740089689689689601792002242800032450000092009136000
145114522092628520082009001573157315780015780020317138400360000052009287090
14521453180367555200520055470054710720010721010215025250280000052006145000
14531454201721755200620060011401140114000114000103160003656000007200684500
145414552075007520042005410081112211221001221102021602400011300000102009185000
14551456607917651999200000953953953694016470021317124600400000082007175000
14561457201317566197819887901635891542207300207310203172250034900000022010210000
1457145870904279194120062750877115211881152023400020419212520600000250052010266500
1458145920971756195019964910290107810780010781010215012403660112000042010142125
14591460209937561965196583029013612561256001256101131601276736680000062008147500